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-H- , Abstract. We introduce kernel estimators for the semicircle law. In this 

first part of our general theory on the estimators, we prove the consistency 

and conduct simulation study to show the performance of the estimators. 

pH ' We also point out that Wigner's semicircle law for our new estimators and 

O |. the classical empirical spectral distributions is still true when the elements 

L of Wigner matrices don't have finite variances but are in the domain of 

.^ , attraction of normal law. 



1. Introduction 
> 

Random matrix theory gained attention due to work by Wishart in 1928 in 
mathematical statistics. After that, the subject got prominence when Wigner 
began to describe the energy levels of a system in quantum mechanics by ran- 
dom matrices. Since then, mathematicians and physicists have been fascinated 
by random matrix theory. Regarding earlier work in this field one may refer 
to the book by Mehta [6] and the recent book by Bai and Silverstein [2] . 

Usually, one assumes that the matrix entries have finite variances. There is 
little work when the finite second moment condition doesn't hold. We refer 
to two papers, [3] and [3]. This paper is concerned with the Wigner matrix 
whose units may have infinite variances. 

Let X be a complex random variable with EX = 0. Write 

l(x) =E\X\ 2 I(\X\ <x). 
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Put b = inf{x > : l(x) > 0}. We suppose that l(x) is a slowly varying 
function as x — > oo. Define 

Jinf{x > 6 + 1 : nl(x) < x 2 } if E\X\ 2 = oo 
I ^/nE'lXl 2 otherwise. 

Then b n — > oo as n — > oo if S|X| 2 = oo, and YTj=iXj/b n — ^ N(0,1) if 
Xi, ■ ■ ■ ,X n are independent and identically distributed (i.i.d.) as X. (This is 
the reason why X is said to be in the domain of attraction of normal law.) 

The Wigner matrix in this paper is defined by 

(1.1) W n = b- 1 (X lJ ) nxn , 

where Xij = Xji, and X^, 1 < i < j < oo are i.i.d. complex random variables 
having the same disttribution as X. The diagonal elements Xa,i = 1,2, •• • 
are i.i.d. real random variables. 

The classical limit theorem regarding W„ concerns its empirical spectral 
distribution, defined by 

where //&, k — 1, • • • , p denote the eigenvalues of W„. 

Wigner [HI [12] found the semicircle law as the limit of F Wn whose distri- 
bution F(x) has a density 

f{x) = { h^^ ifM<2 

otherwise. 

Clearly we can use F Wn (x) to estimate F(x). However we can't make any 
inference on F(x) because there is no central limit theorem about (F Wn (x) — 
F(x)) when the matrix elements only have finite moments. Even the conver- 
gence rate of _F Wn to F is not clear as far as we know. This motivates us 
to find other approaches to understanding the limiting spectral distribution 
F(x). 

Another motivation of our current work is the local semicircle law. In [5] , 
the local semicircle law for Wigner matrices, which is a precise estimate for the 
deviation of the Stieltjes transform of the empirical spectral distribution from 
the semicircle law, is obtained under the sub-exponential moment conditions. 
One can estimate the density curve (local semicircle law) based on the observed 
Wigner matrix by the imaginary part of the Stieltjes transform of the empirical 
spectral distribution. So is it possible to provide some other estimators for the 
density curve? 
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This paper is part of a programme to estimate F(x), as well as f(x) by kernel 
estimators. In this paper we will prove the consistency of those estimators as 
a first step. We also prove that the semicircle law still holds if we only assume 
that £7|X| 2 /(|X| < x) is a slowly varying function of x as x — > oo (which is 
equivalent to the condition that X is in the domain of attraction of normal 
law). 

2. Methodology and Main Results 

Suppose that the observations X\, • • • , X n are i.i.d random variables with an 
unknown density function g(x) and F n (x) is the empirical distribution function 
determined by the sample. Then, a popular nonparametric estimate of g(x) is 

(2.1) /. W = JL± K £lZ) ^JkC-^wm, 

where the function K(y) is a Borel function and h = h(n) is the bandwidth 
which tends to as n — » oo. Obviously, f n (x) is again a probability density 
function and moreover, it inherits some smooth properties of K(x), provided 
that the kernel is taken as a probability density function. Under some regular 
conditions on the kernel, it is well-known that f n {x) — > g(x) in some sense 
(with probability one or in probability). There is a huge literature regarding 
this kind of estimate. For example one may refer to Rosenblatt [9], Parzen [8], 
Hall [7] and the book by Silverman |10j . 

Enlightened by (12. ip . we propose the following estimator f n (x) of f(x), given 
by 

(2.2) f n (x) = L^K{^±) = \J ' K{^JL)dF™*{y), 

(2.3) F n (x) = f f n {t)dt, 

where /Xj,i = 1, ■ ■ • ,p are eigenvalues of W n . It turns out that f n (x) and 
F n (x) are respective consistent estimators of f(x) and F(x) under some regular 
conditions. It is interesting to note that if 

(2.4) K(x) = tt- 1 (1 + x 2 )- 1 , 



then 



f n ( x ) = n- 1 ^ f{X-x- th)- 1 dF Wn (\). 



So the imaginary part of the Stieltjes transform is a special case of our kernel 
estimators. 
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Before we present our main results, we introduce some regular conditions on 
the kernel function. Throughout the paper, suppose that the kernel function 

K(x) satisfies 

(2.5) K(x)>0, jK(x)dx = l, 

(2.6) /!*•(,)! *<». 

Theorem 1. Let W n be defined by ( II. lj) . Suppose that K(x) satisfies {\2.5\) . 
Let h = h(n) be a sequence of positive constants tending to zero. Then, 

(2.7) \\F n - F\\ := sup \F n {x) - F(x)\ -> 

X 

almost surely as n — >■ oo. 

Theorem [JJ follows from the following result. 

Theorem 2. Let W n be defined by (II. ip . T/ien F Wn tends to £/ie semi- 
circlular law almost surely. 

In the literature, Theorem[2Jwas derived under the assumption that i?|X| 2 < 
oo. See [2J. In |3J, Ben Arous and Guiomet considered the spectrum of heavy 
tailed random matrices. The proved that the spectral distribution of X n = 
(Xjk) normalized by a suitable constant tends to a limit \x a when X is in the 
domain of attraction of a a stable law (0 < a < 2). fi a is also characterized 
as having infinite support and being absolute continuous with respect to the 
Lebesgue measure. However from our Theorem [2, we see that /12 is still the 
semicircle law. This is consistent with the result that \x a converges to the 
semicircle law [5] as a tends to 2 from below. 

The final result is about the convergence of the density estimator. Unlike 
Theorems [T] or [2J we need more moment conditions in order to show the con- 
vergence. 

Theorem 3. Let W„ be defined by (II. ip . Suppose that K(x) satisfies (12. 5p 
and (12. 6p . Let h = h(n) be a sequence of positive constants tending zero such 
that nh 2 — > 00. Assume that E\X U \ 3 < 00, _E|X 12 | 6 < 00. Then 

SU P \fn(x) — f(x)\ — > in probability. 

xe[-2,+2] 

3. Simulation Study 

In this section, we perform a simulation study to investigate the behavior of 
the kernel estimators of the semi-circle law. 
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We consider two different populations, exponential and Poisson distribu- 
tions. From each population, we generate two samples of size 50 x 51/2 and 
800 x 801/2 respectively. So we can form two symmetric random matrices 
(Xij) 50x50 and (Xy) 80 ox80o- The kernel is selected as 

K{x) = (2 7 r)- 1 / 2 e^ 2 / 2 , 

which is the standard normal density function. Here we don't choose the 
kernel as (j2.4p since it is well known that the imaginary part of the Stieltjes 
transform divided by n tends to the density function. The bandwidth is chosen 
as h = n~ 2 / 5 (n = 50,800). 

For (X t j) 50x50) the kernel density estimator is 

1 50 

^^E*((,-*)/5crn 

8=1 

where fa, % — 1, • • ■ , 50 are eigenvalues of 50" 1//2 (X i:/ ) 50X 5o. This curve is drawn 
by dotted lines in Figures 1-2. 

For (Xjj) 80 ox80o, the kernel density estimator is 

1 800 

— VKffx-u^/SOO" 2 / 5 ), 

800 x 800- 2 / 5 ^ VV P;/ ; ' 

8=1 

where fa, i = 1, • • • ,800 are eigenvalues of 800~ 1//2 (X i:; ) 8 oox80o- This curve is 
drawn by dashed lines in Figures 1-2. 

The density function of the semicircle law is drawn by solid lines in Figures 
1-2. Here in Figure 1, the distribution of Xu is 

(3.1) P(X U < x) = 1 - e- {x+1 \ x>-l. 

In Figure 2, the distribution is 

P(X n = k) = e- x /{k + 1)!, k = -1, 0, 1, • • • . 

From the two pictures, we see that the estimated curves fit the semicircle law 
very well. As n becomes large, the estimated curves become closer to the 
semicircle law. 

In Figure 3, for the exponential distribution, we compare the kernel distri- 
bution estimator 

/y -.50 
^ V^((x-/ii)/50- 2/5 )^, 
.« 50 x 200- 2 / 5 ^ VV ^ " ' ' 

which is plotted by dotted line, the empirical spectral distribution function 
50 _1 J2t=i I (fa — x ) "which is plotted by the dashed line, and the semicircle 
law distribution function by the solid line. We see that although the two 
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estimated curves are very close to the semicircle law, the kernel distribution 
estimator performs better. 

In Figure 4, based on the eigenvalues //i, • • • , //50 of W 50 , we compare the 
kernel distribution estimator (dotted line) F 50 (y) = f 50x50-2/5 Si=i ^(( 



[x- 



/ii)/50 2 ^ 5 )(ix, the empirical spectral distribution (dashed line) F W50 (y) = 
5CT 1 X)i=i H^i — v)i an d the semi-circle law distribution function. Generally 
speaking, F50 is closer to the semi-circle law than F WrM . 



(Place Figures 1-4 here.) 



4. Proof of Theorems 
Proof of Theorem [TJ The Stieltjes transform of F n (x) is 



p 

v-1 



/ (x — z) l fn(x) dx = (ph) x 2_] / (x — z) L K((x — ^j)/h) dx 
(4.1) = J J (\ + yh-z)- 1 K(y)dydF w -(X). 

where t e R. For arbitrary e > 0, we choose y® such that 

K(y) dy < e. 



\y\>yo 
So 



(4.2) If \(\ + yh-z)- 1 \K(y)dydF m "(\)<e/^z. 

J J\y\>yn 
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Also note that 

f f \(X + yh- z)- 1 - (A- z)- 1 \K(y)dydF w - 
J J\y\<yo 

= 11 u '.ff (y) J r dydF^jX) 
J J\v\<vo \A-z\\\ + yh-z\ 

(4.3) < hy /(Zz) 2 . 
Combining (14. Ij) -( l473|) . we have 

lim I {x-z)- 1 f n {x)dx- j \\-z)- l dF Wn {\) <2e/Sz. 

n->oo J J 

Since e is arbitrary, we can complete the proof. □ 

Proof of Theorem [2j The finite variance case is already treated in the 
literature. We only consider the case where _E|X| 2 = oo. If l[x) = EX 2 I(\X\ < 
x) is slowly varying as x — > oo, then 

(4.4) P(\X\ >x) =o(l(x)/x 2 ), E\X\I(\X\>x)=o(l(x)/x) 
as x — > oo. 

We prove the theorem by the following steps. 

Step 1. Removing the diagonal elements 

Let W n be the matrix derived from W n by replacing the diagonal elements 
with zero. We will show that the limiting spectral distributions of the two 
matrices are the same if one of them exists. 

Let N n = ^fe=i^(l^"fefcl — fr" )• Replace the diagonal elements of W„ by 
bn l XkkI{\Xkk\ ^ b n ) and denote the resulting matrix by W n . Then the Levy 
distance L(F W ™,F W ") satisfies 

n 

L 3 (F w„ 5 F w n) < n -i tr[( w n - W„) 2 ] < (nbiy^xiAlXkkl < b]/ 2 ) < b-\ 

fe=i 
The rank inequality gives 

||i?Wn jF W„|| < Nn / n . 

So it suffices to show that N n /n — > almost surely. Write p n = P(|Xn| > 
bn ) — > 0. From Bernstein inequality, for any e > 0, we have 

P{N n >en)<2 exp ( - (e - Pn ) 2 n 2 /[2np n + 2(e - p n )n]) < e" bn 

for some positive constant b. This completes the first-step proof. 
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In the following, we can assume that the diagonal elements of W n are zero. 

Step 2. Truncation 

Let W n = b~ 1 (xjkl(\xjk\ < b n )). By the rank inequality, 

|| F Wn - F W " || < n -1 rank(W n - W n ) 

(4.5) < 2U- 1 J2 J (M ^ b ")- 

i<i<fc<« 

By (IO|) . we have 

E^n- 1 J2 I(\x jk \>b n ^=o(nl(b n )/b 2 n )=o(l), 
i<i<fc<« 
and 

Var^n- 1 J2 J (M ^ 6 -)) ^ 2P (l X i2l > &«) = oin- 1 ). 

±<j<k<n 

Then , applying Berstein inequality, for all small e > and large n, we have 



(4.6) P^- 1 J2 J (l x ^l > &») > e) < 2 



e er \ 



l<i<fc<" 



which is summable. By (14. 5 j) and (14. 6p . it suffices to show that F w " converges 
to the semicircle law. 



Step 3. Normalization We consider the Levy distance between _F W ™ and 
^w n -Ew„ which tends to zero since 

< (O^^^fc/dx^i^M) 2 

Also noting that 

Var(X 12 /(|X 12 | < b n )) = l(b n ) - (EX 12 I(\X 12 \ < b n )f = l(b n )(l + o^ 1 )) 
by (I4.4p . we conclude that 

upW n -EW n _ pWnU _s. Q 

as n — > oo, where 

/ N-l/2/ 

W n = (riVax(X 12 I(\X 12 \ < b n ))\ (x jk I(\x jk \ < b n ) - Ex jk I(\x jk \ < b n 
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Step 4. Complete the proof. From the above three steps, we know that 
it remains to prove that F w " tends to the semicircle law. Let 

Y jk = [Var(X 12 /(|X 12 | < b n ))Y l/2 [x 3k I{\x ]k \ < b n ) - Ex jk I{\x jk \ < &„)]. 

Then for arbitrary 77 > 0, by (j4.4p and l(r]b n )/l(b n ) — > 1, 

E\Y 12 \ 2 I(\Y 12 \ >r/v^)^0 

as n — > oo. This implies that 

lim n~ 2 y^E\Y jk \ 2 I(\Y jk \ > rjy/E) ->■ 

as n — > oo. So we can conclude the proof by Theorem 2.9 in [2]. □ 

Proof of Theorem [31 Using integration by parts we obtain 

I ~ / K{^±)dF^ (t)-lJ K (^)dF(t) | 

= oj j 



as n — > oo, where the last step uses Theorem 1.1 in [TJ and ( 12. 6p . Finally, 



IJ K(^)dF(t) - f(x)\ 
f{x-t)-f{x))±K{i-)dt\ 
sup | / (f( x -t)-f(x))^Kd)dt\ 

=[-2,2] J|t|><5 v J h h 

l{x-t)-]{x))\K(^)dt\ 



sup 

xe[-2,2] </|t|<<5 



2 sup /(x) / |lT(y)|dy + sup suv\(f(x-t)-f(x)\ []-\K( 

xG[-2,2] J\v\>S/h a;6[-2,2l |t|<<5 J n> 



<2 sup /(x) / \K(y)\dy+ sup sup |(/(x - 1) - f{x)\ I i\K&\dt, 

2,2] J\y\>8/h as6[-2,2] |t|<<5 J n ft 

which goes to zero by noting that f(x) is continuous, choosing sufficiently small 
5 and then for such 8 letting n — > oo. Thus the proof is complete. □ 
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Figure 1: Spectral density curves for Wigner matrices 
n _1 / 2 (Xjj) nxn , Xij ~ exponential distribution 
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Figure 2: Spectral density curves for Wigner matrices 
n~ l ' 2 (Xij) nxn , Xij ~ Poisson distribution 
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Figure 3: Distribution curves for Wigner matrices 
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Figure 4: Distribution curves for Wigner matrices 
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